Cloud Classi cation Using Error-Correcting Output Codes
نویسندگان
چکیده
Novel arti cial intelligence methods are used to classify 16x16 pixel regions (obtained from Advanced Very High Resolution Radiometer (AVHRR) images) in terms of cloud type (e.g., stratus, cumulus, etc.). We previously reported that intelligent feature selection methods, combined with nearest neighbor classi ers, can dramatically improve classi cation accuracy on this task. Our subsequent analyses of the confusion matrices revealed that a small number of confusable classes (e.g., cirrus and cirrostratus) dominated the classi cation errors. We conjectured that, if the class labels in the data were re-represented so that these cloud classes are more easily distinguished, then additional accuracy gains might result. We explored this hypothesis by replacing each class label with a set of error-correcting output codes, a general technique applicable to any classi cation algorithm for tasks with at least three classes. Our initial results are promising; error correcting codes signi cantly increased classi cation accuracy compared with using standard representations for class labels. To our knowledge, this is the rst successful integration of k-nearest neighbor classi ers and error-correcting output codes (i.e., where k is, e ectively, small). One conclusion is that environmental scientists should always select, for their classi cation tasks, a classi er that reduces both variance and learning bias.
منابع مشابه
Sensitive Error Correcting Output Codes
Sensitive error correcting output codes are a reduction from cost sensitive classi cation to binary classi cation. They are a modi cation of error correcting output codes [3] which satisfy an additional property: regret for binary classi cation implies at most 2 l2 regret for cost-estimation. This has several implications: 1) Any 0/1 regret minimizing online algorithm is (via the reduction) a r...
متن کاملSensitive Error Correcting Output Codes
We present a reduction from cost sensitive classi cation to binary classi cation based on (a modi cation of) error correcting output codes. The reduction satis es the property that regret for binary classi cation implies l2-regret of at most 2 for cost-estimation. This has several implications: 1) Any regret-minimizing online algorithm for 0/1 loss is (via the reduction) a regret-minimizing onl...
متن کاملMulti-class Text Categorization with Error Correcting Codes
Automatic text categorization has become a vital topic in many applications. Imagine for example the automatic classi cation of Internet pages for a search engine database. The traditional 1-of-n output coding for classi cation scheme needs resources increasing linearly with the number of classes. A di erent solution uses an error correcting code, increasing in length with O(log2(n)) only. In t...
متن کاملWhy Error Correcting Output Coding Works
Previous research has shown that a technique called error correcting output coding ECOC can dramatically improve the classi cation accuracy of supervised learning algorithms that learn to classify data points into one of k classes This paper presents an empirical investigation of why the ECOC technique works particularly when employed with decision tree learning methods It concludes that an imp...
متن کاملAn approach to fault detection and correction in design of systems using of Turbo codes
We present an approach to design of fault tolerant computing systems. In this paper, a technique is employed that enable the combination of several codes, in order to obtain flexibility in the design of error correcting codes. Code combining techniques are very effective, which one of these codes are turbo codes. The Algorithm-based fault tolerance techniques that to detect errors rely on the c...
متن کامل